Evaluation Of Annotation Schemes For Japanese Discourse Japanese Discourse Tagging Working Group

نویسندگان

  • A. Ichikawa
  • Masahiro Araki
  • Y. Horiuchi
  • Masato Ishizaki
  • S. Itabashi
  • W. Itoh
  • Hideki Kashioka
  • K. Kato
  • H. Kikuchi
  • H. Koiso
  • T. Kumagai
  • A. Kurematsu
  • Kikuo Maekawa
  • Shu Nakazato
  • M. Tamoto
  • S. Tutiya
  • Yoichi Yamashita
  • W. Yoshimura
چکیده

This paper describes standardizing discourse annotation schemes for Japanese and evaluates the reliability of these schemes. We propose three schemes, that is, utterance unit, discourse segment and discourse markers. These schemes have shown to be incrementally improved based on the experimental results, and the reliability of these schemes are estimated as "good" range.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Japanese Dialogue Corpus of Multi-Level Annotation

This paper describes a Japanese dialogue corpus annotated with multi-level information built by the Japanese Discourse Research Initiative, Japanese Society for Artificial Intelligence. The annotation information consists of speech, transcription delimited by slash units, prosodic, part of speech, dialogue acts and dialogue segmentation. In the project, we used the corpus for obtaining new find...

متن کامل

Annotation and Analysis of Discourse Relations, Temporal Relations and Multi-Layered Situational Relations in Japanese Texts

This paper proposes a methodology for building a specialized Japanese data set for recognizing temporal relations and discourse relations. In addition to temporal and discourse relations, multilayered situational relations that distinguish generic and specific states belonging to different layers in a discourse are annotated. Our methodology has been applied to 170 text fragments taken from Wik...

متن کامل

An Efficient Statistical Speech Act Type Tagging System for Speech Translation Systems

This paper describes a new efficient speech act type tagging system. This system covers the tasks of (1) segmenting a turn into the optimal number of speech act units (SA units), and (2) assigning a speech act type tag (SA tag) to each SA unit. Our method is based on a theoretically clear statistical model that integrates linguistic, acoustic and situational information. We report tagging exper...

متن کامل

The Automatic Translation of Discourse Structures

We empirically show that there are significant differences between the discourse structure of Japanese texts and the discourse structure of their corresponding English translations. To improve translation quality, we propose a computational model for rewriting discourse structures. When we train our model on a parallel corpus of manually built Japanese and English discourse structure trees, we ...

متن کامل

A Discourse Structure Analysis of Technical Japanese Texts and Its Implementation on the WWW*

This paper deals with a discourse structure analysis of technical Japanese texts for developing a Japanese writing Computer Assisted Language Learning (CALL) system whose goal is to assist students in learning to write technical Japanese texts. To analyze discourse structures of technical Japanese texts, cohesive expressions are used as cue words. The rules for analyzing texts are based on micr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999